Synthesizing prosody for commands in a Xhosa TTS system
نویسندگان
چکیده
Xhosa is an African tone language spoken in South Africa. The relationship between the prosodic features duration, pitch and loudness of Xhosa commands was determined through acoustic analysis and perceptual experimentation. Combining the results of the acoustic analysis and the perceptual experiment proved to be an appropriate method of parameter extraction with which to formulate a prosodic model for the generation of perceptually acceptable imperatives in a practical Xhosa TTS system.
منابع مشابه
Data-driven approach to rapid prototyping Xhosa speech synthesis
This paper presents work in progress towards building a Xhosa speech synthesizer. HTS is being used for this purpose due to certain desirable properties. As a minority language, linguistic resources for Xhosa are limited despite a variety of impressionistic phonetic studies, prompting a minimalist approach and a preference for data-driven methods. Xhosa is an agglutinative language, and is also...
متن کاملA Hakka Text-To-Speech System
In this paper, the implementation of a Hakka text-to-speech (TTS) system is presented. The system is designed based on the same principle of developing a Mandarin and a Min-Nan TTS systems proposed previously. It takes 671 base-syllables as basic synthesis units and uses a recurrent neural network (RNN)-based prosody generator to generate proper prosodic parameters for synthesizing natural outp...
متن کاملModeling and Synthesizing Emotional Speech for Catalan Text-to-Speech Synthesis
This paper describes an initial approach to emotional speech synthesis in Catalan based on a diphone concatenation TTS system. The main goal of this work is to develop a simple prosodic model for expressive synthesis. This model is obtained from an emotional speech collection artificially generated by means of a copy-prosody experiment. After validating the emotional content of this collection,...
متن کاملSynthesizing Elaborate Intonation Contours in Text-to-Speech for French
This paper presents a modular TTS system (called MINGUS) which exploits syntactic information contained in the input and allows additional annotation of the input in order to obtain particular intonation contours or to vary most prosodic parameters. This system is based on a tonal representation of French intonation, on a model of the interaction between syntax and prosody, and on a model of th...
متن کاملPerformance Analysis of Text To Speech Synthesis System Using HMM And Prosody Features With Parsing For Tamil Language
This paper describes a Hidden Markov Model (HMM) based (TTS) system and prosody based (TTS) system for producing natural sounding synthetic speech in Tamil language. The (HMM) based system consists of two phases such as training and synthesis. Tamil speech is first parameterized into spectral and excitation features using Glottal Inverse Filtering (GIF). An emotions present in the input text is...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2000